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ABSTRACT 

Advanced Encryption Standard (AES) is a specification for electronic data encryption. This 
standard has become one of the most widely used encryption method and has been implemented in 
both software and hardware. A high-secure symmetric cryptography algorithm, implementation on 
field-programmable gate array (FPGA). The proposed architecture includes 8-bit data path and five 
main blocks. We design two specified register banks, Key-Register and State-Register, for storing 
the plain text, keys, and intermediate data. This project is simulated by Modelsim 6.4 c and 
synthesized by Xilinx tool. 
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1. Introduction 

An Advanced encryption standard (AES) is one of the secure symmetric cryptography algorithms 
that is widely used indifferent networks and is the main security part in various applications, 
platforms, and networks, such as IoT and in other Internet standards. According to the length of the 
key, AES provides different security levels. Based on the AES algorithm with a 256-bit key is 
secure enough in the quantum era, which means that this algorithm can provide the security 
requirement of different levels of loT applications and protocols. 

With AES both the encryption and the decryption are performed using the same key. This is called 
a symmetric encryption algorithm. Encryption algorithms that use two different keys, a public and a 
private key, are called asymmetric encryption algorithms. An encryption key is simply a binary 
string of data used in the encryption process. Because the same encryption key is used to encrypt 
and decrypt data, it is important to keep the encryption key a secret and to use keys that are hard to 
guess. Some keys are generated by software used for this specific task. Another method is to derive 
a key from a pass phrase. Good encryption systems never use a pass phrase alone as an encryption 
key. Some of the disadvantages of the software implementation of AES are the high latency for 
processing the data and transmission and consumption of more power. 


2.AES Algorithm 

The Pipelined AES Design for Image Encryption & Decryptionare shown using the figure given 
below. The Pipelined algorithm is a symmetric iterated block cipher. The block and key lengths can 
be 128, 192, or 256 bits. The algorithm begins with an Add round key stage followed by 9 rounds 


@2022, JETMS | Impact Factor Value: 5.672 | Page 301 


International Journal of Engineering Technology and Management Sciences 
Website: ijetms.in Issue: 4 Volume No.6 July — 2022 
DOI:10.46647/ijetms.2022.v06i04.0050 ISSN: 2581-4621 


} 


Hn a 


g 
JINpunoy 


Tpunoy 


TUN punoy 


TJN punoy 


Tpunoy 


INpunoy 


ET WE 


\ 


J 
I ae Ciphertext 


of four stages and a tenth round of three stages. This applies for both encryption and decryption 
with the exception that each stage of a round the decryption algorithm is the inverse of its 
counterpart in the encryption algorithm. 


FIGURE 2.1 AES ALGORITHM 

The four stages are as follows: 

> Substitute bytes 

> Shift rows 

> Mix Columns 

> Add Round Key 
The tenth round simply leaves out the Mix Columns stage. The first nine rounds of the decryption 
algorithm consist of the following: 

> Inverse Shift rows 

> Inverse Substitute bytes 

> Inverse Add Round Key 

> Inverse Mix Columns 


3. Methodology 

The main contribution of our work is to design a lightweight AES architecture. To achieve this 
goal, we employ some of the best implementation techniques and design specified blocks according 
to our goals as follows. 

1) In order to reduce the required logic, the Shift-Rows are embedded inside the State-Register. 

2) We optimize Sub-Bytes block and share it with key expansion phase and encryption phase. 
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3) We design an optimized 8-bit block for Mix-Columns with 8-bit input and output that is based 
on the structure of 8-bit data path, which is followed by Add-Round-Key. In comparison to 32-bit 
Mix-columns, it is not necessary to store the results in the registers or increase the data path for 
Key-Register to 32-bit. 


Input-Plaintext Output 


Input-Key 
| | Key-Register 


FIGURE 3.1 BLOCK DIAGRAM OF PROPOSED SYSTEM 


4) To reduce the power consumption of the design, the clock gating technique is applied in 
different parts of the design, which leads to reduce the power consumption. In the different parts of 
the design, we apply the clock gating technique to reduce the dynamic power consumption. The 
clock gating is separately applied on State-Register, the internal registers of Mix-Columns, Key- 
Register & RCON. For instance, the most power consumption is saved during the key expansion 
phase; the clock of State-Register and Mix-Columns is disabled to save power because these two 
blocks are not used in the key expansion phase. 


State Register 

The State-Register consists of sixteen 8-bit registers, each register contains eight flip-flops. State- 
Register is based on a shift-register memory topology, in which, in each cycle, one 8-bit is fed to 
the design and one 8-bit is stored in the State- Register if needed. To reduce the area, we do not 
design a specific block for Shift-Rows. Since Shift-Rows and Sub-Bytes are executed on each byte 
of the input, these two functions can be exchanged without any effect on final results. We 
calculated the final results after the Shift-Rows function and applied it to the State-Register.One of 
the State-Register’s duties is to execute Shift-Rows. Each register of State-Register contains one or 
two inputs that require a multiplexer to select from two inputs. One input receives the data from the 
previous register to execute the encryption phase and pass the data between internal registers The 
other one includes some interconnections between different units of the State- Register to execute 
Shift-Rows. Thus, Shift-Rows isdone by wiring, and this completely removes the logic for Shift- 
Rows step. 
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Key-Register 

The architecture of the proposed Key-Register includes sixteen 8-bit registers, in which each 
register contains eight flip-flops with one or two input(s) and a 2-1 MUX, to store the keys. It also 
contains one 8-bit input and two 8-bit outputs. The reason that we design our Key-Register with 
two outputs is that to expand the keys, two columns of the previous keys are required at the same 
time. Key-Register includes five main operations: first, store the initial keys in the Key-Register; 
second, feed the design by one 8-bit for encryption phase. And store the same key again in the same 
cycle, thus after 16 clock cycles, the keys are stored in the same position before encryption phase. 


Sub-Bytes Optimization 

Sub-Bytes is one of the most critical parts of the AES design in terms of power, area, and latency. 
Our design contains one Sub-bytes, which is used for both the encryption and key expansion. There 
are different methods of implementations of this block. Although the most straight forward way of 
implementation id lookup table (LUT). 


KeyExpansion 

For each round of the algorithm, one 128-bit key is needed. The key in each round is generated 
from the previous round key. Storing all the keys in the design requires a huge memory, which is 
not a proper method for resource-constrained devices. Thus,On-the-fly key expansion is a wise way 
that only requires one 128-bit register. The key expansion phase contains the shift of the last 
column of Key-Register, Sub-Bytes, RCON, and XOR. 


Control Unit 

The control unit includes one 4-bit and 2-bit counters. During the data encryption, the Key-Register 
and State-Register put one element in data pathand store one element according to the shift-register 
topology. 


Clock-Gating Technique 

Clock gating is a popular technique used in many synchronous circuits for reducing dynamic power 
dissipation. Clock gating saves power by adding more logic to a circuit to prune the clock tree. The 
clock -gating is one of the effective logics in RTL and architectural power reduction. Clock gating 
is an effective technique to reduce dynamic power, because individual IP usage varies across 
applications, not all IP cores are used all the time, giving rise to opportunity for reducing the 
unused IP cores power.Bycombining (AND gate) the clock with a gate-control signal, clock gating 
essentially disables the clock to an IP core. When that IP is not used, avoiding power dissipation 
due to unnecessary charging and discharging of the unused circuits. The continuous decrease in the 
minimum feature size of transistors which increase of both device density and design complexity. 


4.Proposed Application 
An Image Cryptography (Encryption & Decryption Based on Images). We are going to 
merge the Matlab & VLSI for this process. We are using Matlab to convert an image into plaintext 
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and then it is encrypted using a key. Then, decrypted using the same key. For the conversion of an 
image into text Matlab is used. And Modelsim is used for the VLSI process. 


The Following process is given in steps: 


S-1: A colour image of 256 x 256 uploaded. 

S-2: Then, the colour image is converted into grey image. 

S-3: Now, the input image is encrypted using the verilog coding. 

S-4: Again, the input image is seen using decryption verilog coding.Thus, the image cryptography 
is enumerated. 


VLSI Process Using Modelsim 


Plain Text —» P — » Cipher Text — mp = —» Plain Text 


Image Converted as $ $ Text File is Converted as 
Text File by Matlab Image using Matlab 


Encryption Decryption 


FIGURE 4.1 PROPOSED APPLICATION DIAGRAM 


5. Results and Discussion 


Synthesization Using Xilinx 9.1 Tool 

In Comparison to the existing system and technologies, the verilog code for an encryption and 
decryption using nano AES(Advanced Encryption Standard) clock gating technique is computed 
which gives the output of device utilisation summary where the usage of Slice Registers, Flip- 
Flops, Look-up Table are reduced. 
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| Device Utilization Summary | E| 
| Slice Logic Utilization [Used Available | Utilization Note(s) | 
Number of Slice Registers 1,066 207,360 1% 
Number used as Flip Flops 1,066 | 
Number of Slice LUTs 3,900 207,360 1% 
Number used as logic 3,882 207,360 1% 
| Number using O6 output only o 3,882 E 
Number used as exdusive route-thru | 18 
Number of route-thrus | 18 
| Number using O6 output only 18 
Number of occupied Slices f 1,670 51,840 3% 
Number of LUT Flip Flop pairs used | 4,155 | 
| Number with an unused Flip Flop | 3,089 4,155 74% 
Number with an unused LUT | 255 4,155 6% 
Number of fully used LUT-FF pairs 811 4,155 19% 
Number of unique control sets 144 
Number of slice register sites lost | 418 207,360 1% 
to control set restrictions 
Number of bonded IOBs | 518 1,200 43% 
IOB Latches | 1 | | 
Number of BlockRAM/FIFO 3 288 1% 


FIGURE 5.1 SUMMARY REPORT 


Schematic View 

It shows the implementation logic of the circuit that how data flows in and out from the circuit. 
RTL is used in the logicdesign phase of the integrated circuit design cycle. An RTL schematic 
description is usually converted to a gate-level description of the circuit by a logic synthesis tool. 
The synthesis results are then used by placement and routing tools to create a physical layout. 
Logic simulation tools may use a design's RTL description to verify its correctness. The Schematic 
View of Encryption & Decryption Module with Clock Gating using Xilinx tool is shown below. 


FIGURE 5.2 ENCRYPTION & DECRYPTION MODULE WITH CLOCK GATING 
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Simulation Using ModelSim 6.4a 

The code is simulated to display forms. To display waveforms, select the design under test in the 
"sim" tab, right-click the mouse, and select "Add > To Wave >All items in region". Alternately, 
bring up a wave window by selecting "View > Wave" in the ModelSim menu. The waveform 
shows occurrence of encryption and decryption of the system. 


00004453454320564c53492050726f6a 
[TEST AES FINAL ftext in 00004453454320 1234567890 12345678 
[TEST AES FINAL fenc_data 90ed3741378b951 tebfebd5731766287 
[TEST_AES FINAL/dec_data ~ | 0004453454320 1234567890 12345678 


[TEST_AES FINAL /MMO/Text_In 

[TEST_AES_FINAL/MMO/Key 

[TEST _AES _FINAL/MMO/Text_Out 

[TEST _AES_FINAL/MMO/Done 

[TEST _AES _FINALMMO/Ck _Gatting 

[TEST_AES_FINAL/MMO/MO]dk 

TEST_AES_FINALMMO/MOjrst 

(TEST_AES_FINAL/MMO/MO/id 

JTEST_AES_FINAL MMO)MO/done 

[TEST _AES_FINAL/MMOMO key 

[TEST _AES_FINAL/MMO/MO text in 

[TEST _AES_FINAL/MMOMO/text out | 10010000111011010011011101000001001101 
[TEST_AES_FINAL/MMOMO/wO 11111110000010010001011011100010 
{TEST_AES_FINAL/MMO/MO/w 1 
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FIGURE 5.4 APPLICATION OUTPUT 


@2022, IJETMS | Impact Factor Value: 5.672 | Page 307 


| International Journal of Engineering Technology and Management Sciences 
Website: ijetms.in Issue: 4 Volume No.6 July — 2022 
DOI:10.46647/ijetms.2022.v06i04.0050 ISSN: 2581-4621 


Image Cryptography using MatLab and Modelsim (Application) 

We are using Matlab to convert an image into plaintext and then it is encrypted using a key. Then, 
decrypted using the same key. For the conversion of an image into text MatLab is used. And 
Modelsim is used for the VLSI process. 


The given image is converted into plain text. And then, encrypted using a key. It is again decrypted 
using same key. Thus, the image is decrypted. 


Comparison Table 

In this comparison table, Area and Delay of the Normal AES Design and Proposed Nano-AES with 
clock-gating is compared. As a result of this comparison, area and delay is reduced in our proposed 
system to a great extent. 


S. No Area Delay 
Method Name Slice Flip LUT Max Gate Path 
Flops Delay Delay Delay 
1 Normal AES Design 7734 | 21207 | 21207 | 160.860ns | 25.302ns | 135.558ns 
2 Proposed Nano AES | 1670 1066 3900 3.405ns 2.923ns 0.482ns 
with Clock Gating 
TABLE 5.1 COMPARISON BETWEEN NORMAL AND PROPOSED AES WITH CLOCK-GATING 


CONCLUSION 

Nano AES is a secure symmetric cryptography algorithm with a high level of security, which is 
widely used in many applications and networks. Thus, AES is a suitable algorithm for tiny IoT 
devices. In this project, we designed a lightweight AES architecture for resource-constrained IoT 
devices.The power consumption of the design was simulated in different timing constraints. To 
make a fair comparison with other similar works, we calculated the normalized power of previous 
works. The power consumption of the proposed design was better than most of the previous works. 
The result of the proposed lightweight AES design is suitable for resource-constrained devices and 
can be supplied by low-power devices. As it is said, AES is a high secure symmetric algorithm. 
The AES with key length 256-bit is secure in the quantum era. In future work, wewill focus on 
designing a postquantum resistance AES for resource constrained IoT devices. Increasing the 
number of rounds and keys has a high impact on area, power, and latency. To mitigate these issues, 
we will focus on an optimized architecture for the design to reduce the power. 
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